Feature Selection for Designing a Novel Differential Evolution Trained Radial Basis Function Network for Classification

نویسندگان

  • Ch. Sanjeev Kumar Dash
  • Aditya Prakash Dash
  • Satchidananda Dehuri
  • Sung-Bae Cho
چکیده

This work presents a novel approach for classification of both balanced and unbalanced dataset by suitably tuning the parameters of radial basis function networks with an additional cost of feature selection. Inputting optimal and relevant set of features to a radial basis function may greatly enhance the network efficiency (in terms of accuracy) at the same time compact it size. In this paper, the authors use information gain theory (a kind of filter approach) for reducing the features and differential evolution for tuning center and spread of radial basis functions. The proposed approach is validated with a few benchmarking highly skewed and balanced dataset retrieved from University of California, Irvine (UCI) repository. The experimental study is encouraging to pursue further extensive research in highly skewed data. DOI: 10.4018/jamc.2013010103 International Journal of Applied Metaheuristic Computing, 4(1), 32-49, January-March 2013 33 Copyright © 2013, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. if the whole dataset is used. Many variants of evolutionary and non-evolutionary based approaches are discussed in Derrac, Garcia, and Herrera (2010). The ideal outcome of instance selection is a model independent, minimum sample of data that can accomplish tasks with little or no performance deterioration. However, in this work, we restrict ourselves with feature selection only. Feature selection can be broadly classified into two categories: 1. filter approach (it depends on generic statistical measurement); and 2. wrapper approach (based on the accuracy of a specific classifier) (Aruna et al., 2012). In this work, the feature selection is performed based on information gain theory (entropy) measure with a goal to select a subset of features that preserves as much as possible the relevant information found in the entire set of features. After selection of the relevant set of features the fine tuned radial basis function network is modeled using differential evolution for classification. Over the decade radial basis function (RBF) networks have attracted a lot of interest in various domain of interest (Haykin, 1994; Novakovic, 2011; Naveen, Ravi, Rao, & Chauhan, 2010; Liu, Mattila, & Lampinen, 2005). One reason is that they form a unifying link between function approximation, regularization, noisy interpolation, classification, and density estimation. It is also the case that training RBF networks is usually faster than training multi-layer perceptron networks. RBF network training usually proceeds in two steps: First, the basis function parameters (corresponding to hidden units) are determined by clustering. Second, the final-layer weights are determined by least square which reduces to solving a simple linear system. Thus, the first stage is an unsupervised method which is relatively fast, and the second stage requires the solution of a linear problem, which is also fast. One of the advantages of RBF neural networks, compared to multi-layer perceptron networks, is the possibility of choosing suitable parameters for the units of hidden layer without having to perform a non-linear optimization of the network parameters. However, the problem of selecting the appropriate number of basis functions remains a critical issue for RBF networks. The number of basis functions controls the complexity, and hence the generalization ability of RBF networks. An RBF network with too few basis functions gives poor predictions on new data, i.e. poor generalization, since the model has limited flexibility. On the other hand, an RBF network with too many basis functions also yields poor generalization since it is too flexible and fits the noise in the training data. A small number of basis functions yields a high bias, low variance estimator, whereas a large number of basis functions yields a low bias but high variance estimator. The best generalization performance is obtained via a compromise between the conflicting requirements of reducing bias while simultaneously reducing variance. This trade-off highlights the importance of optimizing the complexity of the model in order to achieve the best generalization. However, choosing an optimal number of kernels is beyond the focus of this paper. In training procedure of RBFNs revealing center of gravity and width is of particular importance of the improvement of the performance of the networks. There are many approaches along the line with their own merits and demerits (Storn & Price, 1995; Storn & Price, 1997; Price, Storn, & Lampinen, 2005). This paper discusses the use of differential evolution to reveals hidden centers and spreads. The motivation using differential evolution (DE) over other EAs (Michalewicz, 1996) such as GAs (Goldberg, 1989) is that in DE string encoding are typically represented as real valued vectors, and the perturbation of solution vectors is based on the scaled difference of two randomly selected individuals of the current population. Unlike GA, the resulting step size and orientation during the perturbation process automatically adopt to the fitness function landscape. The justification behind combining the idea of feature selection with classification is to reduce the space, time, and accuracy. This article is set out as follows. Section 1 gives overview of RBF network, feature selection, and differential evolution. In Section 2, the 16 more pages are available in the full version of this document, which may be purchased using the "Add to Cart" button on the product's webpage: www.igi-global.com/article/feature-selection-designing-noveldifferential/77298?camid=4v1 This title is available in InfoSci-Journals, InfoSci-Journal Disciplines Computer Science, Security, and Information Technology. Recommend this product to your librarian: www.igi-global.com/e-resources/libraryrecommendation/?id=2

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Neural Network Based Recognition System Integrating Feature Extraction and Classification for English Handwritten

Handwriting recognition has been one of the active and challenging research areas in the field of image processing and pattern recognition. It has numerous applications that includes, reading aid for blind, bank cheques and conversion of any hand written document into structural text form. Neural Network (NN) with its inherent learning ability offers promising solutions for handwritten characte...

متن کامل

Novel Radial Basis Function Neural Networks based on Probabilistic Evolutionary and Gaussian Mixture Model for Satellites Optimum Selection

In this study, two novel learning algorithms have been applied on Radial Basis Function Neural Network (RBFNN) to approximate the functions with high non-linear order. The Probabilistic Evolutionary (PE) and Gaussian Mixture Model (GMM) techniques are proposed to significantly minimize the error functions. The main idea is concerning the various strategies to optimize the procedure of Gradient ...

متن کامل

DE+RBFNs based classification: A special attention to removal of inconsistency and irrelevant features

A novel approach for the classification of both balanced and imbalanced dataset is developed in this paper by integrating the best attributes of radial basis function networks and differential evolution. In addition, a special attention is given to handle the problem of inconsistency and removal of irrelevant features. Removing data inconsistency and inputting optimal and relevant set of featur...

متن کامل

Improving Accuracy of DGPS Correction Prediction in Position Domain using Radial Basis Function Neural Network Trained by PSO Algorithm

Differential Global Positioning System (DGPS) provides differential corrections for a GPS receiver in order to improve the navigation solution accuracy. DGPS position signals are accurate, but very slow updates. Improving DGPS corrections prediction accuracy has received considerable attention in past decades. In this research work, the Neural Network (NN) based on the Gaussian Radial Basis Fun...

متن کامل

A Novel Radial Basis Function Networks Locally Tuned with Differential Evolution for Classification: An Application in Medical Science

The classification of diseases appears as one of the fundamental problems for a medical practitioner, which might be substantially improved by intelligent systems. The present work is aimed at designing in what way an intelligent system supporting medical decision can be developed by hybridizing radial basis function neural networks (RBFNs) and differential evolution (DE). To this extent, a two...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Int. J. of Applied Metaheuristic Computing

دوره 4  شماره 

صفحات  -

تاریخ انتشار 2013